Search for: All records

Creators/Authors contains: "Dragan, Anca D."

« Prev Next »

Total Resources

9

Resource Type
Conference Paper

5

Conference Proceeding

0

Dataset

0

Journal Article

4

Workshop Report

0

Availability
Full Text / Resource Available

9

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Assisted Robust Reward Design

He, Jerry Zhi-Yang ; Dragan, Anca D. ( November 2021 , Conference on Robot Learning)

Real-world robotic tasks require complex reward functions. When we define the problem the robot needs to solve, we pretend that a designer specifies this complex reward exactly, and it is set in stone from then on. In practice, however, reward design is an iterative process: the designer chooses a reward, eventually encounters an "edge-case" environment where the reward incentivizes the wrong behavior, revises the reward, and repeats. What would it mean to rethink robotics problems to formally account for this iterative nature of reward design? We propose that the robot not take the specified reward for granted, but rather have uncertainty about it, and account for the future design iterations as future evidence. We contribute an Assisted Reward Design method that speeds up the design process by anticipating and influencing this future evidence: rather than letting the designer eventually encounter failure cases and revise the reward then, the method actively exposes the designer to such environments during the development phase. We test this method in a simplified autonomous driving task and find that it more quickly improves the car's behavior in held-out environments by proposing environments that are "edge cases" for the current reward.
more » « less
Agnostic Learning with Unknown Utilities

https://doi.org/10.4230/LIPIcs.ITCS.2021.55

Bhatia, Kush ; Bartlett, Peter L. ; Dragan, Anca D. ; Steinhardt, Jacob ( January 2021 , Leibniz international proceedings in informatics)
null (Ed.)
Full Text Available
Physical interaction as communication: Learning robot objectives online from human corrections

https://doi.org/10.1177/02783649211050958

Losey, Dylan P. ; Bajcsy, Andrea ; O’Malley, Marcia K. ; Dragan, Anca D. ( October 2021 , The International Journal of Robotics Research)

When a robot performs a task next to a human, physical interaction is inevitable: the human might push, pull, twist, or guide the robot. The state of the art treats these interactions as disturbances that the robot should reject or avoid. At best, these robots respond safely while the human interacts; but after the human lets go, these robots simply return to their original behavior. We recognize that physical human–robot interaction (pHRI) is often intentional: the human intervenes on purpose because the robot is not doing the task correctly. In this article, we argue that when pHRI is intentional it is also informative: the robot can leverage interactions to learn how it should complete the rest of its current task even after the person lets go. We formalize pHRI as a dynamical system, where the human has in mind an objective function they want the robot to optimize, but the robot does not get direct access to the parameters of this objective: they are internal to the human. Within our proposed framework human interactions become observations about the true objective. We introduce approximations to learn from and respond to pHRI in real-time. We recognize that not all human corrections are perfect: often users interact with the robot noisily, and so we improve the efficiency of robot learning from pHRI by reducing unintended learning. Finally, we conduct simulations and user studies on a robotic manipulator to compare our proposed approach with the state of the art. Our results indicate that learning from pHRI leads to better task performance and improved human satisfaction.

more » « less
How to Be Helpful to Multiple People at Once

https://doi.org/10.1111/cogs.12841

Gates, Vael ; Griffiths, Thomas L. ; Dragan, Anca D. ( June 2020 , Cognitive Science)

Full Text Available
Quantifying Hypothesis Space Misspecification in Learning From Human–Robot Demonstrations and Physical Corrections

https://doi.org/10.1109/TRO.2020.2971415

Bobu, Andreea ; Bajcsy, Andrea ; Fisac, Jaime F. ; Deglurkar, Sampada ; Dragan, Anca D. ( June 2020 , IEEE Transactions on Robotics)

Full Text Available
LESS is More: Rethinking Probabilistic Models of Human Behavior

https://doi.org/10.1145/3319502.3374811

Bobu, Andreea ; Scobee, Dexter R. ; Fisac, Jaime F. ; Sastry, S. Shankar ; Dragan, Anca D. ( March 2020 , International Conference on Human-Robot Interaction (HRI))

Full Text Available
Enabling robots to communicate their objectives

https://doi.org/10.1007/s10514-018-9771-0

Huang, Sandy H. ; Held, David ; Abbeel, Pieter ; Dragan, Anca D. ( February 2019 , Autonomous Robots)

Full Text Available
Human-AI Learning Performance in Multi-Armed Bandits

https://doi.org/10.1145/3306618.3314245

Pandya, Ravi ; Huang, Sandy H. ; Hadfield-Menell, Dylan ; Dragan, Anca D. ( January 2019 , Artificial Intelligence, Ethics and Society (AIES))

Full Text Available
Establishing Appropriate Trust via Critical States

https://doi.org/10.1109/IROS.2018.8593649

Huang, Sandy H. ; Bhatia, Kush ; Abbeel, Pieter ; Dragan, Anca D. ( October 2018 , IROS)

Full Text Available